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ABSTRACT 

In  several  unpublished  manuscripts  written  from  1993  to  1995,  Michael  Stein,  C.L.  Winter,  and  Robert 
Tenney  introduced  a  multitarget  tracking  and  evidential-accumulation  concept  called  a  ’’Probability  Hy¬ 
pothesis  Surface”  (PHS).  A  PHS  is  the  graph  of  a  probability  distribution — ^the  Probability  Hypothesis 
Density  (PHD) — that,  when  integrated  over  a  region  in  target  state  space,  gives  the  expected  number 
of  targets  in  that  region.  The  PHD  is  uniquely  defined  by  this  property:  Any  other  density  function 
that  satisfies  it  must  be  the  PHD.  In  particular,  the  PHD  is  the  expected  value  of  the  point  process  of  a 
random  track-set — i.e.,  of  the  density  that,  when  integrated  over  a  region  in  state  space,  gives  the  exact 
(random)  number  of  targets  in  that  region.  In  1997  in  the  book  Mathematics  of  Data  Fusion  I  sketched 
the  elements  of  a  theoretical  foundation  for  PHS/PHD.  The  purpose  of  this  paper  is  to  publish  a  full 
accoimt  of  this  material  for  the  first  time.  We  show  that  the  PHD  is  a  first-order  moment  statistic  of  the 
random  multitarget  process  and,  consequently  that  from  a  computational  perspective  it  is  a  multitarget 
analog  of  single-target  constant-gain  Kalman  filters  such  as  the  a-^-7  filter.^ 

1.0  INTRODUCTION 

In  several  unpublished  manuscripts  written  during  the  period  from  1993  to  1995,  Michael  Stein  and 
C.L.  Winter  (Los  Alamos  National  Laboratory)  and  Robert  Tenney  (Alphatech  Corp.)  introduced  a 
multitarget  tracking  and  evidential-accumulation  concept  ’’called  a  ’’Probability  Hypothesis  Surface”  or 
”PHS”  [20,19].  A  PHS  is  the  graph  of  a  certain  unnormalized  probability  distribution — ^the  PHD  or 
Probability  Hypothesis  Density  — ^that  has  the  following  property:  Given  any  region  S  in 

target  state  space,  the  integral  D;fe|fe(x|Z^*^^)dx  is  the  expected  number  of  targets  contained  in  S.  This 
property  characterizes  the  PHD  uniquely.  That  is,  if  ^fc(x)  is  any  other  density  which  gives  the  expected 
number  of  targets  in  S  when  integrated  over  5,  then  it  is  (no  matter  how  imaginative  the  name  one  might 
assign  to  it)  nothing  else  but  the  PHD.  For,  since  ^fc(x)dx  =  bk\k{x\Z^^^)dx  for  all  measurable  S 
then  =  £^k\k  almost  everywhere.  In  particular,  the  PHD  is  the  expected  value  of  the  point  process  of 
a  random  track-set — i.e.,  of  the  density  that,  when  integrated  over  a  region  in  state  space,  gives  the  exact 
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(random)  number  of  targets  in  that  region  (see  Section  2.4).  Stein  and  Winter  devised  the  PHD  concept 
in  part  as  a  structure  for  a  new  inference  technique  called  Weak  Evidence  Accrual  (WEA)  that  exploits 
the  additive  rather  than  multiplicative  properties  of  Bayes’  rule.  We  will  not  discuss  this  aspect  of  the 
PHD  approach  in  great  detail  here  (see,  however,  Theorem  5).  Rather,  we  will  discuss  the  potential 
significance  of  the  PHD  as  a  computational  strategy  for  Bayes-optimal  multitarget  filtering — specifically, 
as  a  a  multitarget-tracking  analog  of  constant-gain  Kalman  filters  such  as  the  a-fi-j  filter  that  results  in 
additive  information-update  rules  of  the  WEA  type. 

In  1997  in  section  4.3.4,  pages  168-170  of  the  book  Mathematics  of  Data  Fusion  [7]  I  sketched 
the  elements  of  a  theoretical  foundation  for  PHD  based  on  the  ’’finite-set  statistics  (FISST)”  approach 
described  in  Chapters  2  and  4  through  8  of  that  book.  Because  of  page  limitations,  the  full  description 
of  this  work — specifically,  the  proofs  of  the  various  assertions — had  to  be  cut  fi-om  the  final  draft  of  the 
book.  The  purpose  of  this  paper  is  to  publish  this  material  in  the  open  literature  for  the  first  time,  as 
well  as  to  show  how  FISST  tools  such  as  the  set  derivative  can  be  used  to  develop  a  Bayes  filtering  scheme 
for  PHD’s.  Because  of  space  limitations,  it  is  not  possible  to  provide  a  summary  of  FISST  and  the  FISST 
calculus  in  this  paper.  See  the  monograph  An  Introduction  to  Multisource- Multitarget  Statistics  and  Its 
Applications  [11]  and  the  book  chapter  Multisensor- Multitarget  Statistics  [13]  for  more  details. 

1.1  APPROXIMATION  WITH  STATISTICAL  MOMENTS 

The  theoretical  starting-point  of  single-target  tracking  is  the  following  Bayesian  discrete-time  recursive 
nonlinear  filtering  equations  (see  [8],  [17],  [2,  pp.  373-377],  and  [9,  p.  174]: 


^MAP 

^fc+l|fc+l 


J /fc+i|fc(xfe+i|xfc)  /fc|fc(xfc|Z*')dxfc 
/(Zfc+i|Xfc+i)  /fc4.i|fc(Xfc+i|Z'') 


argsup/fc+i|fc+i(x|Z''+^),  ^k+^k+i=  y^x-/fc+i|fc+i(x|Z''+*)dx 


where 

(1)  Xk  is  the  target  state  variable  at  time-step  k  and  Zk  is  the  observed  measurement  at  time-step  k] 

(2)  fk\k{^k\^^)  is  the  Bayes  posterior  distribution  conditioned  on  the  data-stream  =  {zi, 

(3)  /(z|x)  is  the  sensor  likelihood  function; 

(4)  fk^i\k{^k-\-i\^k)  is  the  target  Markov  transition  density  that  models  between-measurements  target 
motion; 

(5)  fk’^i\k{^k-^i\^^)  is  the  time-prediction  of  the  posterior  fk\k{^k\2^)  to  time-step  /c  -h  1; 

(6)  {zk^i\Z^)  =  J  /(zfc-f-ilyjfe+i)  /fc+i|fe(yfc+i|^^)dyfc+i  is  the  Bayes  normalization  constant;  and 

(7)  Bayes-optimal  maximum  a  posteriori  (MAP)  and  expected  a  posteriori 
(EAP)  state  estimators,  respectively. 

In  all  of  these  formulas,  data  and  state  vectors  have  the  form  y  =  (l/i, where 
are  continuous  variables  and  are  discrete  variables,  and  we  denote  the  space  of  all 

state  vectors  as  5.  Integrals  of  functions  of  such  variables  involve  both  summations  and  continuous 
integrals.  Since  state  vectors  x  may  have  discrete  components,  /(z|x)  can  encompass  different 
measurement  models  for  different  target  types  and  /fc4.i|fc(xfc+i|xfc)  can  encompass  different  motion 


modeb  for  different  target  types.  If  the  measurement  and  motion  modeb  are  linear  and  Gaussian,  the 
above  equations  reduce  to  the  Kalman  time- update  and  information-update  equations,  respectively  [8]. 

All  relevant  information  about  the  state-vector  x  of  the  target  at  time-step  k  is  contained  in 
the  Bayes  posterior  density  function  /fe|fc(x|Z^).  Updating  it  to  a  new  posterior 
using  the  Bayes  filtering  equations  usually  presents  a  formidable  computational  challenge.  If  signal-to- 
nobe  ratio  (SNR)  is  high  enough  that  all  time-evolving  posteriors  are  not  too  complex,  however,  one  can 
compress  the  posterior  into  a  finite  number  of  summary  statistics  and  propagate  these  statistics  in  time 
instead  of  the  posterior  itself.  The  two  most  familiar  summary  statistics  are  the  first-moment  vector  and 
second-moment  matrix 

Xfcifc  =  Jx  /fc|fc(x|Z'')dx,  Ofcifc  =  J fkik{x\Z'‘)dx 

where  denotes  matrix  tranpose.  If  SNR  b  so  high  that  the  second-order  and  higher  moments  can 
be  neglected  then  the  first  moment  b  approximately  a  sufficient  statistic,  /fc|fc(x|Z^)  =  fk\k{^\^k\k)-  In 
this  case  we  can  propagate  Xjfe|;fe  in  time  instead  of  the  full  dbtribution  using  a  constant-gain  Kalman 
filter  such  as  the  a-/3-7  filter — i.e.,  using  completely  linear  equations.  Otherwise,  if  SNR  is  such  that 
the  higher-order  moments  can  be  neglected  but  covariance  cannot,  then  x^j^  and  Qk\k  are  approximate 

sufficient  statbtics,  /fc|fc(x|Z^)  =  fk\k{^\^k\k^  Qk\k)j  and  we  can  propagate  Xk\k  and  Qk\k  using  a  Kalman 
filter. 

1.2  STATISTICAL  MOMENTS  FOR  MULTITARGET  PROBLEMS 

Stein  and  Winter’s  PHD  approach  can  be  thought  of  as,  in  part,  an  attempt  to  extend  the  reasoning  just 
outlined  to  multitarget  tracking  problems.  In  such  problems  the  optimal  approach  would  be  to  write 
down  the  following  multisensor,  multitarget  analogs  of  the  Bayes  nonlinear  filtering  equations: 

fk+i\k{Xk+i\Z^’‘  J  fk+i\k{Xk+i\Xk)fk\k{Xk\Z^’^^)6Xk 

fk+l\h+l{.Xk+\\Z^*‘'^^^  CC  f{Zk+l\Xk+l)  fk+l\k{Xk+i\Z^’^^) 

r\^\ 

In  thb  case, 

(1')  Xk  b  the  multitarget  state^  i.e.  the  set  of  unknown  target  states  (which  are  abo  of  unknown  number) 
and  Zk  is  the  set  of  all  measurements  collected  off  of  all  targets  at  time-step  k] 

(2')  fk\k{Xk\Z^^^)  is  a  multitarget  posterior  density  at  time-set  k  conditioned  on  the  time-stream  Z^^^  = 
{Zi,  ...,Z^}; 

(3')  f{Z\X)  b  the  multisensor,  multitarget  likelihood  Junction  that  describes  the  likelihood  of  observing 
the  observation-set  Z  given  that  the  multitarget  system  has  multitarget  state-set  X] 

(4')  fk^i^k{Xk-\-i  \Xk)  is  the  multitarget  Markov  transition  density  that  describes  the  likelihood  that  the 
targets  will  have  state-set  at  time-step  A:  -h  1  given  that  they  had  state-set  at  time-step  k\ 

(5')  is  the  time-prediction  of  the  multitarget  posterior  fk\k{^k\Z^^^)  to  time-step 

A:  -l-  1; 

(6')  /(Z;b+i|Z(*^^)  =  J /(Zfc+i|y)  fk^\\k{y\Z^^^)SY  is  the  Bayes  normalization  constant;  and 


(7')  iffc+i  ^  ^  multitarget  analog  of  the  MAP  estimator  (whose  direct  multitarget  extension  is 
undefined) . 

The  multitarget  filtering  equations  cannot  be  used  in  the  blind  fashion  just  indicated  but,  rather, 
require  the  tools  of  finite-set  statistics  (FISST).  [7,11]  A  short  history  of  multitarget  Bayes  filtering  can 
be  found  in  Section  1.  4  below. 

In  more  detail,  a  multitarget  state-set  X  has  the  form 

A  =  0,  {x},  {Xi,X2}  {xi  x^ 

where  X  =  0  indicates  that  no  target  is  present,  X  =  {x}  indicates  that  one  target  with  state  x  is 
present,  X  =  {xi,X2}  indicates  that  two  targets  with  states  Xi,X2  are  present,  and  so  on.  The  Bayes 
multitarget  posterior  f^^{X\Z^^^)  has  the  form 

/fc|fc(0|Z^^^)  =  posterior  likelihood  that  no  targets  are  present 

/A;|fc({x}l^^*^^)  =  post.  like,  of  one  target  with  state  X 

/ifejfc({xi,  =  post.  like,  of  n  targets  with  states  Xi,  .,x-^ 

It  must  sum  to  one  over  all  multitarget  states.  That  is,  let 

=  fk\kmz(%  fk\k{i\Z^'‘^)  =  ^  y  .,Xi}\Z^’^^)dxi  dxi 

be  the  marginal  posterior  probability  that  there  are  i  =  0, 1, 2, ...  targets  present.  Then  for  f^k{X\Z^) 
to  be  a  multitarget  probability  density  the  following  quantity,  called  a  set  integral^  must  sum  to  one: 

1  =  J  =  /fc|fc(0|z(*=))  +  +  A|fc(2|zW)  -I-  4-  /fc|fc(n|zW)  + 

Given  the  formidable  computational  complexity  of  the  single-target  Bayes  nonlinear  filtering  equa¬ 
tions,  it  should  be  clear  that  this  complexity  will  be  magnified  many-fold  in  multitarget  problems.  Drastic 
but  intelligent  approximation  strategies  are  required.  In  a  recent  paper  [14]  I  proposed  one  computational 
strategy  based  on  a  multitarget  analog  of  the  familiar  Gaussian  approximation.  In  this  paper  I  exploit 
a  different  computational  analogy:  propagating  multitarget  analogs  of  first-order  (and/or  second-order) 
moments  of  the  time-evolving  random  track-set.  I  use  the  multitarget  moments  outlined  in  Section  4.3.4, 
pages  168-170  of  Mathematics  of  Data  Fusion.  [7]  Let  x  be  a  fixed  target  state.  Then  for  any  i>\ 
the  marginal-density  value 

^  J  fk\k{{x.,Xi,...,Xi}\Z^’°^)dxi---dxi 

is  the  total  posterior  likelihood  that  the  multitarget  system  has  i  +  1  targets  and  that  one  of  these  targets 
has  state  x.  Consequently,  for  each  x  the  marginal-density  value 

=  I  =  J  fk\k{{x,x„ 


Xi}\Z^’^'>)dxx  dxi 


is  the  total  posterior  likelihood  that  the  multitarget  system  contains  a  target  that  has  state  x.  Consequently, 
will  tend  to  have  maxima  approximately  at  the  locations  of  the  targets. 

It  can  be  shown  (see  Section  2.2)  that  Dkij^{{x}\Z^^^)dx  is  the  expected  number  of  targets  in  S — 
which  means  that  is  the  same  thing  as  the  Stein- Winter  PHD  Dfc|fc(x|Z^^^).  Intuitively 

speaking,  just  as  the  value  of  the  probability  density  function  /x(x)  of  a  continuous  random  vector  X 
provides  a  means  of  describing  the  zero-probability  event  Pr(X  =  x),  so  the  PHD  Dr(x)  of  a  finite 
random  track-set  T  provides  a  means  of  describing  the  zero-probability  event  Pr(x  €  P)  (see  Section 
2.5).  Also,  the  state  vector  x  in  any  PHD  should  be  interpreted  as  an  accumulated  (or  compressed) 
multitarget  state  rather  than  as  a  conventional  single-target  state. 

Last  but  not  least,  from  the  point  of  view  of  point-process  theory^  the  PHD  is  the  same  thing  as  the 
expectation  density  or  first  factorial-moment  density  of  the  random  track-set  T  at  time-step  k  (see  Section 
2.4).  This  means,  in  particular,  that  Dk\j^{x\Z^^^)  is  a  type  of  least-squares  best-fit  approximation  of 
the  multitarget  posterior  fk\k{^\Z^^^)  by  a  single-target  density  function  (see  Section  2.4).  In  this  sense 
the  PHD  is  a  multitarget  analog  of  the  single-target  first-order  moment  x^i^.  If  the  multitarget  sensing 
situation  is  benign  enough — meaning  that  signal-to-clutter  ratio  (SCR)  as  well  as  SNR  is  large,  then  the 
PHD  will  be  an  approximate  sufficient  statistic:  fk\k{^\^^^^)  —  fk\k{^\^k\k)'  principle,  therefore, 
it  should  be  possible  and  desirable  to  propagate  Dk\k{^\^^^^)  instead  of  the  full  multitarget  posterior 
fk\k{^\^^^^)j  using  suitable  analogs  of  the  single-target  Bayes  recursive  filtering  equations  of  Section  1.1. 
Real-time  multitarget  tracking  would  then  be,  from  a  computational  point  of  view,  reduced  to  the  (still 
very  difficult)  problem  of  implementing  a  real-time  single-target  nonlinear  filter  capable  of  modeling  the 
rather  complex  time-evolution  of  the  PHD. 

That  is,  what  we  would  like  to  be  able  to  do  is  to  establish  the  existence  of  a  diagram  of  the  form 


fk\k  prediction 


.multitarget  Bayes’  rule 


i 


f fc+1 1 


D: 


'k\k 


^PHD  prediction??  _^PHD  Bayes  rule?? 


where:  (1)  the  top  row  portrays  the  time-sequence  of  the  multitarget  Bayes  filtering  equations;  (2) 
the  downward-pointing  arrows  indicate  the  replacement  of  multitarget  posteriors  by  their  corresponding 
PHD’s;  and  (3)  the  bottom  row  portrays  a  recursive  time-sequence  of  filtering  operations  on  PHD’s  that 
always  yields  the  result  that  one  would  get  if  one  computed  multitarget  posteriors  using  only  the  top  row 
and  then  transformed  them  into  their  PHD  approximations. 

Our  goal,  then,  is  to  fill  in  the  "question  marks”  in  the  bottom  row  of  the  diagram.  We  will  show 
(Section  3.1)  that  the  PHD  Dk\k{^\Z^^^)  can,  under  certain  assumptions,  be  time-propagated  between 
measurements  to  a  new  PHD  Dfc_f_i|fc(x|Z^^^)  using  a  suitable  extension  of  the  first  of  the  two  single-target 
Bayes  filtering  equations  of  Section  1.1,  one  that  accounts  for  multitarget  behaviors  such  as  appearance 
and  disappearance  of  targets.  We  will  also  show  (Section  3.2)  that,  given  a  new  multisensor-multitarget 
observation-set  Zfc+i,  the  PHD  Z>fc_j.i|fc(x|Z^^^)  can  be  updated  to  a  new  PHD 
using  one  of  two  different  approximate  methods.  First,  a  relatively  simple  PHD  version  of  Bayes’ 
rule;  and  second,  a  more  complicated  but  "additive"  update  transformation  based  on  the  concept  of  the 
"approximate  multitarget  posterior"  of  a  PHD. 

1.3  RELATED  APPROACHES 

The  idea  of  using  a  single-target  density  function  gk\k  (or?  more  commonly,  probability  contours  of 
its  graph)  as  a  basis  for  multitarget  tracking  is  a  relatively  common  one.  Examples  are  the  Naval  Re- 


search  Laboratory’s  TABS  (Tactical  Antisubmarine-warfare  Battle-management  System)  tracker,  Metron 
Corp.’s  Nodestar  tracker,  [22]  and  others  [23].  The  work  described  in  this  paper  differs  from  earlier 
work  in  its  systematic  and  theoretically  rigorous  treatment  of  the  ’’correct”  ^  ^  PHD:  i.e.,  as  a 

first-order  factorial-moment  statistic  of  the  multitarget  system. 

1,4  A  SHORT  HISTORY  OF  BAYES  MULTITARGET  FILTERING 

The  concept  of  multitarget  Bayesian  nonlinear  filtering  (Section  1.3)  is  a  relatively  new  one.  If  one 
assumes  that  the  number  of  targets  is  known  beforehand,  the  earliest  exposition  appears  to  be  due  to 
Washburn  [24]  in  1987,  using  a  point  process  formulation  (see  Section  2.4  for  a  summary  of  point  process 
theory). 


Date 

Author(s) 

Theoretical  Basis 

1991 

Miller  et.  al.  [10] 

’Mump  Diffu.sic3n” 

Stochastic  PDEs 

1994 

Bethel  and  Paras  [3] 

Discrete  filtering 

1994 

Mahler  [11] 

’’Finite-Set  Stati-stios” 

FhSST 

1996 

Stone  et.  al.  [21] 

’’Unified  Data  Fusion” 

Heuristic 

1996 

Mahler-Kastella  [15] 

’’Joint  Multitarget  Probabilities” 

FISST 

1997 

Portenko  et.  al.  [16] 

Point  processes 

The  table  summarizes  the  history  of  the  approach  when  the  number  n  of  targets  is  not  known  and  must 
be  determined  along  with  the  individual  target  states.  The  earliest  work  in  this  case  appears  to  be  due 
to  Miller,  O’Sullivan,  Srivistava,  et.  al.  [10].  Their  very  sophisticated  approach  requires  solution  of 
stochastic  diffusion  equations  on  non-Euclidean  manifolds.  It  is  also  apparently  the  only  approach  to 
deal  with  continuous  evolution  of  the  multitarget  state.  (All  other  approaches  listed  in  the  table  assume 
discrete  state-evolution.)  Mahler  was  apparently  the  first  to  systematically  deal  with  the  general  discrete 
state-evolution  case  (Bethel  and  Paras  assume  discrete  observation  and  state  variables).  Kastella’s  ’’joint 
multitarget  probabilities  (JMP),”  introduced  at  Lockheed  Martin  in  1996,  are  a  renaming  of  a  number 
of  early  core  FISST  concepts  (set  integrals,  multitarget  information  metrics,  multitarget  posteriors,  joint 
multitarget  state  estimators,  etc.)  devised  two  years  earlier  [15],  A  ”  JMP”  itself  is  just  a  discretization 
of  a  FISST  (or,  for  that  matter,  a  Jump-Diffusion)  multitarget  posterior: 

n\  /jMp(Xi  'X^\Z)  =  /f/55t({Xi  ,Xn}\Z)  =  n!  fFISST^X-l,  ,Xn\Z) 

Stone  et.  al.  have  provided  a  valuable  contribution  by  clarifying  the  relationship  between  multitarget 
Bayes  filtering  and  multi- hypothesis  correlation.  Nevertheless,  their  approach  is,  with  regrets,  described 
as  ’’heuristic”  in  the  table  for  the  reasons  summarized  in  [11,  pp,  91-93). 

1.5  ORGANIZATION  OF  THE  PAPER 

The  paper  is  organized  as  follows.  Basic  concepts  are  covered  in  Section  2:  multitarget  moment  densities 
(Section  2.1);  the  properties  of  multitarget  moments  and  the  PHD,  including  their  direct  construction 
using  set  derivatives  (Section  2.2);  examples  of  PHD’s  (Section  2.3);  identification  of  the  PHD  as  a 
first-order  statistical  moment  (Section  2.4);  and  PHD’s  in  the  discrete-state  case  (Section  2.5).  Section 
3  is  devoted  to  the  Bayes  filtering  equations  for  the  PHD:  the  time-update  equation  (Section  3.1),  the 


approximate  Bayes-update  equation  (Section  3.2) ,  and  prior  PHD’s  (Section  3.3)  Proofs  of  the  theorems 
are  relegated  to  Section  4.  Conclusions  may  be  foimd  in  Section  5. 


2.0  MULTITARGET  MOMENT  DENSITIES  AND  THE  PHD 


The  purpose  of  this  section  is  to:  (1)  formally  define  the  concept  of  a  multitarget  moment  density  function 
Djc\jc{X\Z^^^)]  (2)  provide  a  general  procedure  for  constructing  it  using  the  FISST  set  derivative;  and 
(3)  show  that  the  first-order  multitarget  moment  density  Z>fc|fc({x}|Z(^^)  is  the  same  thing  as  the  Stein- 
Winter  Probability  Hypothesis  Density  I  define  multitarget  moment  densities  in  Section 

2.1,  describe  their  major  properties  in  Section  2.2,  provide  examples  of  PHD’s  in  Section  2.3,  show  that 
the  PHD  is  a  first-order  statistical  moment  in  Section  2.4  and,  in  Section  2.5,  consider  PHD’s  in  the 
special  case  when  the  target  state-space  is  discrete. 

2.1  MULTITARGET  MOMENT  DENSITIES 

I  begin  with  a  definition: 

Definition  1  (Multitarget  Moment  Densities)  [7,  p.  169];  The  multitarget  moment  density 
is 

''  i=0 

Notice  that  =  1.  If  the  number  \X\  of  elements  in  X  is  restricted  to  n  then  I  will 

call  the  function  Dk\k{{xi,  ...^Xn\\Z^^"^)  the  n'th  multitarget  moment  density.  Also  notice  that  the 
set  integral  is  well-defined  in  the  sense  that  fk\k{X  U  W\Z^^^)  always  has  the  same  units  as  X  and  so 
there  are  no  units-mismatch  problems  of  the  kind  described  in  [11,  p.  39].  For  any  multitarget  state 
X  =  {xi,...,x„},  Dfe|fc({xi,...,x„}|Z(*=))  is  the  marginal-posterior  likelihood  that,  regardless  of  how 

many  targets  there  may  be  in  the  multitarget  system,  exactly  n  of  them  have  states  Xi, ...,  Xn- 

2.2  COMPUTING  PHD’S  USING  THE  SET  DERIVATIVE 


In  this  section  I  show  how  to  construct  multitarget  moment  densities  directly  from  the  random  multitarget 
track-set  using  the  FISST  set  derivative  (Theorem  1)  and  use  this  fact  to  show  that  the  PHD  and  the 
first-order  multitarget  moment  density  are  the  same  thing  (Theorem  2).  I  begin  by  demonstrating  the 
first  result  (which  will  also  allow  us  to  compute  between- measurement  laws  of  motion  for  PHD’s  directly 
from  multitarget  motion  models,  see  Section  3.1).  Let  be  the  random  set  of  current  tracks  at  time- 
step  k,  meaning  that  fk\k{X\Z^^^)  is  the  multitarget  density  corresponding  to  the  belief-mass  function 
=  Pr(rfc  C  S).  Then: 


Theorem  1  (Computing  Multitarget  Moments  Using  the  Set  Derivative)  [7,  p.  169]:  Let 

Dk\k{X\Z^^^)  be  the  multitarget  moment  density  corresponding  to  the  multitarget  posterior  fk\k{X\Z^^^). 
Then: 

for  all  finite  subsets  X  of  (single-target)  state  space  <S. 


The  proof  of  this  assertion  can  be  found  in  Section  4.1.  It  should  be  compared  to  the  similar  formula 
for  constructing  multitarget  posterior  densities  [11,  pp.  30-31] : 


=  ^(0|ZW) 


As  already  noted,  because  of  Theorem  1  we  can  show  that  the  first-order  moment  density  and  the 
PHD  are  the  same  thing.  If  PI  S  is  the  set  of  tracks  contained  in  S  then  jF^  fl  5|  is  the  number 
of  tracks  in  S  and  E[|Fa;  PI  5|]  is  the  expected  number  of  tracks  in  S.  Then: 

Theorem  2  (The  PHD  is  the  First  Multitarget  Moment  Density)  [7,  p.  169]:  For  any 

measurable  subset  S  CS  of  state-vectors, 

E[|rfcnS|]=  /  £>fc|fc({x}|zW)(ix 
Js 

Consequently,  the  first  multitarget  moment  density  and  the  Probability  Hypothesis  Density  (PHD)  are 
equal  almost  everyhere:  Dk\k{^\Z^^^)  = 

See  Section  4,2  for  the  proof. 

2,3  EXAMPLES  OF  PHD’s 

2.3.1  EXAMPLE  1:  INFORMATION  LOSS  IN  PHD’s.  Information  is  lost  when  we 

compress  a  single-target  posterior  density  into  its  first  moment  Xk\k  =  /fc|fc(x|Z*^)dx. 

Likewise,  information  is  lost  when  we  compress  a  multitarget  posterior  into  its  PHD 

For  example,  suppose  that  we  are  trying  to  determine  the  locations  of  two  targets  on  the 
real-number  line  based  on  a  single  sensor-scan  Zi  =  {zi^Z2}^  Suppose  that  the  multitarget  posterior 
has  the  form 

fl\l{{xi,X2}\Z^^^)  =  N„2{Xi  -  Zi)Nir2{X2  -  Zi)  +  N^2{X2  -  Zi)N„2{Xi  -  Zi) 

where  Nff2(x)  =  {■\/^a)~^  exp{—x^/2cr^)  is  the  normal  distribution  with  variance  The  correspond- 
ing  PHD  and  second  moment  are,  respectively, 

Di\i{x\Z^^'>)  =  J  fi\i{{x,y}\Z^^'>)dy  =  N„2{x-zi)  +  N„2{x-Zi) 

£>i|i(K,X2}|Z(‘))  =  /i|i({xi,X2}|Z(i)) 

Note  that  J  Di^i{x\Z^^^)dx  =  2,  so  that  the  expected  number  of  targets  is  two.  In  general,  Di\i  is 
bimodal.  However,  it  is  easily  shown  that  it  is  unimodal  with  maximal  value  at  x  —  ^{zi-\-Z2)  whenever 
\zi  —  Z2\  <  2a,  The  multitarget  posterior  /iji({xi,X2}|.^^^^),  on  the  other  hand,  is  always  unimodal  (as 
a  function  of  a  set  variable)  but  fails  to  distinguish  two  distinct  targets  when  \zi  —  Z2\  <  y/2a.  In  this 
case  its  unique  maximal  value  is  located  at  Xi  =  X2  =  ^(2:1  -h  2:2).  So,  for  data  separations  in  the  range 
^/2a  <  12:1—2:21  <  2a  the  multitarget  posterior  is  capable  of  separating  two  targets  whereas  the  PHD 
is  not.  This  indicates,  not  unexpectedly,  that  a  PHD-based  multitarget  tracker  will  experience  more 
difficulty  with  closely-spaced  targets  than  would  a  tracker  based  on  the  full  multitarget  nonlinear  filtering 
equations  of  Section  1.2. 

2.3.2  EXAMPLE  2:  CLUTTER  AND  PHD’s.  Extend  the  previous  example  by  including 
the  effects  of  clutter.  Assume  that  a  single  sensor  observes  two  targets  with  no  missed  detections  but 
with  false  alarms  governed  by  the  independent  clutter  process  k{Z),  If  we  collect  one  scan  Zi  =  {2:1, 2:2} 
consisting  of  two  distinct  observations  2:1 , 2:2  and  assume  a  uniform  prior  then  the  following  multitarget 
posterior  is  the  result: 

/i|i({xi,X2}|^^^^)  =  .  ^  {N„2{Xi -zi)N „2{xi  -  Zi)  +  N„2{xi- z-i)N„2(xi  -  Zi)) 

^  {zi,z2y^z 


with  fi\i{X\Z^^^)  =  0  whenever  X  does  not  contain  exactly  two  elements,  The  corresponding  PHD 

zGZ 

Notice  that  f  bi\i{x\Z^^^)dx  =  2.  As  a  function  of  the  set  {xi,X2},  the  multitarget  posterior  has 
Cm, 2  =  ^rn{m  - 1)  peaks.  Each  peak  corresponds  to  a  different  hypothesis  regarding  which  two-element 
subset  of  Z  are  target-generated  reports  rather  than  false  alarms.  Since  the  PHD  condenses  multitarget 
information  into  a  density  on  single-target  state  space,  it  has  at  most  m  peaks.  If  targets  are  sufficiently 
separated,  each  peak  corresponds  to  a  different  hypothesis  about  the  location  of  the  individual  targets. 

2.4  THE  PHD  IS  A  FIRST-ORDER  MOMENT  OF  A  RANDOM  SET  [7,  p.  169] 

The  purpose  of  this  section  is  to:  (1)  show  that  the  multitarget  moment  density  Dk\k{X\Z^’^^)  is 
identical  to  the  ’’factorial-moment  densities”  of  point  process  theory;  (2)  conclude  that  the  PHD  is  a  first 
mom<^nt  of  the  multitarget  system;  and  (3)  provide  an  inversion  formula  for  transforming  multitarget 
moment  densities  into  multitarget  posteriors  fk\k{X\Z^’°^).  All  unreferenced  page  numbers 

in  this  section  refer  to  the  textbook  by  Daly  Vere- Jones  [4].  Let  Pj,  be  a  random  track-set.  Then 
either  the  random  integer-valued  measure 

iV;fe(5|E'W)  =  |rfcn5|=  /  5r.(  x)dx  =  no.  of  tracks  in  contained  in  region  S 

Js 

or  its  random  density  function  ^r^Cx)  =  Z)wGrte  ^w(x)  is  called  a  multi- dimensional  point  process.  Point 
process  theory  is  a  special  case  of  random  set  theory  and,  in  fact,  multi-dimensional  point  processes  seem 
to  have  been  originally  defined  as  random  sets  rather  than  as  random  measures  [1].  The  statistical 
behavior  of  Nk{S\Z^^^) — or,  equivalently,  of  Tk  and  6r^  (x) — is  characterized  by  its  family  jfc,t(xi,  ...,Xi) 
of  Janossy  densities  (pp.  122-123).  Janossy  densities  are  completely  symmetric  in  all  arguments;  vanish 
whenever  x^  =  Xj  for  some  i^  j  (p.  134,  Prop.  5.4.IV);  and  are  jointly  normalized  in  the  sense  that 
12^0  ^Jfc,i(xi,...,Xi)  =  1.  The  multitarget  posterior  density  fk\k{X\Z^^^)  of  Tk  is  the  same  thing  as 
the  family  of  Janossy  densities  jk,i  of  Nk{S\Z^^^): 

jk,iixi, ...,  Xi)  =  /fclfc({xi,  ..., Xi}|Z('')) 

for  all  distinct  xi,...,Xi  (In  like  manner,  the  following  quantities  are  just  diflferent  notations  for  a 
multitarget  posterior: 

A|fc({xi,...,x„}|zW)  =  fkik{S..+...  +  S^jZ^’^^)  = 

where  the  second  and  third  quantities  denote,  respectively,  posterior  probability  distributions  over  all 
point-process  densities  Sx  or  all  point-process  measures  Nx-) 

The  expected  value 

=  E[Arfe|fc(5|Z(^))]  =  EilPfc  n51]  =  jT  E[«r.(x)]dx 

is  called  the  expectation  measure  or  first  factorial-moment  measure  of  Nk{S\Z^^^)  (p.  130)  Its  density 

m^,p](x)  =  E[6r.(x)]  =  J 6x{x)  fkik{X\Z^'’^)SX 


is  called  the  expectation  density  or  first  factorial-moment  density.  Higher-order  factorial-moment  densi¬ 
ties  mjfe  can  be  defined  (pp.  130,  112)  and,  from  Definition  1  (Section  2.1),  it  follows  that 

(p.  133,  equation  5.4.11): 

Dk\k{{xi,  =  mk,[j]{xi,...,Xj) 

for  distinct  Xi, Xj  €  S.  That  is,  the  are  statistical  moments  of  the  random  set  Tk  and 

the  PHD  is  the  first  factorial-moment  density.  Moreover,  the  multitarget  posterior  density 
can  be  recovered  from  the  multitarget  moment  density  Dk\k{X\Z^^^)  via  the  following  set  integral  (p. 
133,  equation  5.4.12): 

Theorem  3:  (Inversion  formula  for  multitarget  moment  densities)  [7,  p.  169]: 

=  I  U  W\Z^'^^)8W 

This  formula  confirms  the  obvious  fact  that  the  multitarget  posterior  cannot  be  completely  described 
by  any  one  multitarget  moment  density  D]^\)^{X\Z^^^)  and,  in  particular,  by  the  PHD  Dk\}^{x\Z^^^). 
Rather,  all  multitarget  moment  densities  are  required  to  completely  recover  the  information  contained  in 
the  multitarget  posterior. 

2.5  THE  PHD  IN  THE  DISCRETE-STATE  CASE  [7,  p.  169] 

Suppose  that  (single-target)  state  space  5  is  a  finite  set  of  target-state  cells  x.  Let  F  be  the  randomly 
varying  track-set  and  note  that  fk\k{X\Z^^^)  =  Pr(r  =  X).  Then: 

=  E  =  E  =  ^)  =  E  ^  ^  ^ 

X3x  X3x  X 

This  result  shows  that,  in  the  discrete  case,  the  PHD  of  a  random  track-set  F  is  the  same  thing  as  I.R. 
Goodman’s  one-point  covering  function  //^(x)  =  Pr(x  G  F)  =  D(x)  of  the  random  set  F.  [5,6]  The 
existence  of  this  relationship  is  the  reason  why,  in  Mathematics  of  Data  Fusion^  I  used  the  term  ’’global 
covering  densities”  for  what  in  this  paper  I  call  ’’multitarget  moment  densities.”  It  also  shows  that  in  the 
continuous  case,  the  PHD  provides  a  means  of  representing  the  zaro-probability  event  Pr(x  €  F)  in  much 
the  same  way  that  the  density  /x(x)  of  a  continuous  random  vector  provides  a  means  of  representing 
the  zero-probability  event  Pr(X  =  x).  Furthermore,  it  is  easy  to  show  that 

E Dk\k{x\z^'°^)  =  EEp(^  e  X, r  =  X)  =  E  ^  ^))  Pr(x)  =  E[  |r  n  x| ] 

X  X  X  X  \  X  / 

In  the  fuzzy  logic  literature,  the  sum  ^^./^(x)  is  called  the  ’’sigma-count”  of  the  fuzzy  membership 
function  and  is  interpreted  as  the  ’’number  of  elements”  in  the  fuzzy  set  corresponding  to  //. 

3.0  RECURSIVE  BAYES  FILTERING  OF  THE  PHD 

In  this  section  we  derive  recursive  filtering  equations  for  the  PHD  analogous  to  the  single-target  Bayes 
recursive  filtering  equations  of  Section  1.1.  These  equations  include  between-measurements  time-update 
equations  (see  Section  3.1)  and  an  approximate  Bayes  information-update  equation  (see  Section  3.2). 
The  construction  of  prior  PHD’s  is  discussed  in  Section  3.3. 

3.1  TIME-UPDATE  OF  THE  PHD 


The  between-measurements  time-evolution  of  a  single-target  posterior  is  described  by  the  first  of  the 
single-target  Bayes  filtering  equations  of  Section  1.1.  The  purpose  of  this  section  is  to  show  how  to 
construct  similar  laws  of  motion  for  PHD’s.  Let  fk^i\k{Y\X)  be  the  multitarget  Markov  transition 
density  that  corresponds  to  some  multitarget  motion  model  [11,  pp.  21-23]  and  let 
be  the  time-predicted  multitarget  posterior  as  computed  in  the  first  of  the  multitarget  Bayes  filtering 
equations  of  Section  1.2.  The  PHD’s  corresponding  to  these  two  multitarget  posteriors  are 

=  J  /fc|fc({x}  U  Dfc+i|fc(x|Z«)  =  J  /fc+i|fc({x}  U  W\Z^^^)8W 

It  might  appear  that  the  law  of  motion  relating  and  is  nothing  more 

complicated  than  the  following  simple  analog  of  the  first  Bayes  filtering  equation: 

^fc+iifc(y|^*’)  =  j /fc+i|fe(y|x)  Dfe|fc(x|z*)dx 

In  actuality,  this  evolution  law  describes  only  multitarget  motion  in  which  target  motions  are  independent 
and  the  number  of  targets  does  not  change  (Corollary  2  below).  We  begin,  therefore,  with  a  general 
result  that  encompasses  quite  general  multitarget  motion  models  that  account  for  ” birth’’  and  ’’death” 
of  targets — i.e.,  targets  that  enter  and  leave  the  scenario  for  whatever  reason.  The  result  itself  is  less 
important  than  its  method  of  proof  (see  Section  4.3). 

Theorem  4  (Laws  of  Motion  for  PHD’s):  Suppose  that  between  time-step  k  and  time-step 
A: -hi,  the  following  multitarget  motion  model  is  assumed:  (1)  target  motions  are  statistically  independent; 
(2)  the  Markov  motion  model  for  single  targets  is  fk+i\k{y\^)l  (3)  probability  that  any  individual 
target  will  not  ’’die”  (i.e.,  not  vanish  from  the  scenario)  if  it  has  state  x  at  time-step  k  is  dk^i\k{^)i 
and  (4)  each  target  with  state  x  at  time-step  k  generates,  independently  of  all  other  targets,  new 
’’birth”  targets  in  a  fashion  governed  by  a  FISST  multitarget  density  &fcH-iiib(5^|x).  Then: 

Dk+i\k{y\Z^'‘^)  =  j  (4+i|fc(x)/fc+i|fc(y|x) +6fc+i|fc(y|x)^  bk\k{^\z^’°^)(hi 

where  &jfe-(-i|fc(y|x)  denotes  the  PHD  of  the  multitarget  density  6fc_|.ijfc(X|x). 

The  proof  can  be  found  in  Section  4.3.  Let  Nk\k  =  /  be  the  expected  number 

of  targets  in  the  track-set  F^.  Then  from  Theorem  4  it  follows  that  the  expected  number  Nk+i\k  of 
time-extrapolated  targets  is 

Nk+Mk^  j  {dk^i\k{^)Nk\k  +  b^ik{^\Z^''^)dx 

where  j  j  ^  (x)  =  /  1  j  fc  (y  | x)  dy  is  the  expected  number  of  birth  targets . 

The  following  is  a  special  case  of  Theorem  4  that  employs  a  simple  state-dependent  Poisson  model 
to  account  for  the  appearance  of  new  targets. 

Corollary  1  (Law  of  Motion  for  PHD’s  With  Poisson  Births):  Suppose  that  between 

time-step  k  and  time-step  A:  -h  1,  the  following  multitarget  motion  model  is  assumed:  (1)  target 
motions  are  statistically  independent;  (2)  the  Markov  motion  model  for  single  targets  is  /fc-f.i|fc(y|x);  (3) 
the  probability  that  any  individual  target  will  not  ’’die”  is  dk^i\k\  and  (4)  each  state  x  at  time-step  k 


generates,  independently  of  all  other  targets,  new  ’’birth”  targets  in  a  Poisson-distributed  fashion  with 
Poisson  parameter  and  birth  distribution  &fc+i|fc(y|x).  Then: 


Dk+i\k{y\Z^’‘^)=  I  (4+i|fc/fc+i|fc(y|x)  +  Afc+  :6fe+i|fe(y|x))  I)fc|fc(x|z(''^)(ix 
The  proof  of  this  fact  follows  immediately  from  Theorem  4  by  noting  that 

*ifc+i|fc({y:  yn}|x)  =  e"^*-  i''A^^i|;i.6fc+i|fe(yi|x)  6fc+i|fc(yn|x)  i>fc+i|fc(0|x)  =  e 

and  so 

&i+i|fc(*5|xi)  =  V-r  /  &fc+i|fc({yi,  yn}|x)rfyi  dy„  = 
and  therefore 


ftfc+i|fc(y|xi)  = 


Sb: 


'fc+llfc 


(■5|xi) 


s=. 


=  Ajb+i|fc6fc+i|jb(y|xi 


Note  that  the  time-extrapolated  number  of  targets  is  =  (dfc+ijfc  +  A^+ii^)  We  conclude 

by  deriving  the  law  of  motion  for  PHD’s  whose  between-measurements  time-evolution  is  governed  by  the 
simplest  possible  multitarget  motion  model. 

Corollary  2  (Simplest  Law  of  Motion  for  PHD’s):  Suppose  that  between  time-step  k  and 
time-step  A:-|- 1,  the  following  multitarget  motion  model  is  assumed:  (1)  target  number  does  not  change; 
(2)  target  motions  are  statistically  independent;  and  (3)  the  Markov  transition  model  for  the  single-target 
motion  model  is  /fc+i|fc(y|x).  Then 

^fc+iifeCyl^^*"^)  =  j /fc+i|fc(y|x)  £)fc|fe(x|z(''))dx 

The  proof  of  this  fact  results  from  setting  =  1  targets  disappear)  and  =  0  (no 

targets  appear)  in  Corollary  1.  Stated  in  different  words:  Given  this  simple  mult iar get  motion  model, 
the  between-measurements  time-evolution  of  the  PHD  is  governed  by  the  same  law  of  motion  as  that 
which  governs  the  between-measurements  time-evolution  of  the  posterior  density  of  any  single  target  in 
the  multitarget  system. 

3.2  BAYES  INFORMATION-UPDATE  OF  THE  PHD 

In  the  single-target  case,  when  a  new  measurement  is  collected  this  information  can  be  incorporated 

into  the  time-extrapolated  posterior  /fc+ijfc(y|^*^)  using  Bayes’  rule  (the  second  of  the  single-target 
Bayes  filtering  equations  of  Section  1.1).  The  question  that  confronts  us  in  this  section  is  as  follows. 
Suppose  that  we  have  collected  a  new  multisensor-multitarget  observation-set  Zk-\-i-  Let  f{Z\X)  be 
the  multisensor-multitarget  likelihood  function  that  corresponds  to  some  multisensor-multitarget  sensor 
model  [11,  pp.  17-20]  and  let  the  Bayes-rule  update  of  the  time-predicted  multitarget  posterior  be 
computed  as  in  the  second  of  the  multitarget  Bayes  filtering  equations  of  Section  1.2.  Then  given  the 
corresponding  PHD’s 

Ofc+i|fc+i(x|z(*=+i))  =  [ /fc+i|fc+i({x}  UT^1ZW)5W  £>fc+i|fc(x|zW)  =  I  fk+nk{{^}  U  W\Z('^^)6W 


what  rule  will  allow  us  to  use  Zfc+i  to  update  to  get  As  it 

turns  out,  it  is  not  possible  to  construct  a  simple  recursive  update  for  PHD’s  that  faithfully  reflects  the 
Bayes  update  on  multitarget  posteriors. 

3.2.1  INFORMATION-UPDATE  USING  AN  APPROXIMATE  BAYES’  RULE.  We 

have  no  choice,  then,  but  to  adopt  an  approximate  Bayes  update  step.  One  possible  approximation  is 
suggested  by  turning  to  the  discrete  case  described  in  Section  2.5.  Assume  that  current  observations 
depend  only  on  the  current  condensed  state  x — i.e,  Pr(Zfc+i|x  G  =  Pr(Zfc4.i|x  € 

Then 


£>fe+i|fc+i(x|Z('=+i))  =  Pr(x  G  r|Z('=+'))  = 


t4+i 


where 

^  _  If{Z\{x}uW)fo{{x}UW)6W 

U[^\x)-  //o({x}uW^)5H^ 

The  likelihood  should  be  ’’biased’’  only  by  the  previous  PHD  Therefore  we  assume  that 

fo{X)  is  a  multitarget  uniform  density  u{X)  (see  Section  3.3  below). 

3.2.2  INFORMATION-UPDATE  USING  THE  APPROXIMATE  POSTERIOR  OF  A 
PHD.  The  work  described  in  this  paper  can  be  viewed  from  a  different  perspective  that,  for  lack  of 
space,  we  can  only  summarize  here.  (Details  will  appear  in  a  subsequent  paper.)  Let  fjc\k{X\Z^^^) 
be  a  multitarget  posterior,  Dfc|fc(x|Z^^^)  its  associated  PHD,  and  =  J  Dk\f^{x\Z^^^)dx,  We  want 

to  approximate  by  an  approximate  multitarget  posterior  —  fk\k  ^  covariance-free — i.e., 

whose  multitarget  moments  are 

Dk\kWz(’'^)  =  =  £>fc|fc(xj|Z(''))  Dfc|fc(x„|zW) 

where  X  =  {xi  ,Xn)  Theorem  3  of  Section  2.4  yields 
A|fc(x|zW)  =  y(-i)i^iDfe|fe(xuy|zW)5y 

OO  1  /» 

J2-I  (-l)'£>fe|fe(xa|zW)  •  •  •  Dfc|fc(x„|ZW)Dfc|;fc(yi|Z«)  Dfe|fc(y„|ZW)dy:  dy, 

i=sO  *' 

Dfc|fc(xi|Z«)  Dfc|fc(x„|Z('=))f^^^^^^  =  e-^''i'=L»fe|fc(xi|Z«)  Dfc|fc(x„|zW) 

i=0 

where  /fc|fc(0|ZW)  =  Because  A|fc  S /fci*.  we  can  propagate  fk\k  in  place  of  fk\k-  Given  this, 

Section  3.1  can  be  interpreted  in  a  different  light.  We  replace  siiiA  /fc+i|fc  by  fk\k  fk-hi\k 
determine  what  law  of  motion  Dk\h  corresponds  to  the  law  of  motion  fk\k  fk^i\k  specified 

by  the  multitarget  time-prediction  integral  of  Section  1.2.  Theorem  4  emerges  as  a  consequence. 

The  multitarget  Bayes’  rule  information-update  step  of  Section  1.2)  can  be  interpreted  in  a  similar 
manner.  That  is,  we  replace  fk+i\k  and  /fc+ijfc+i  by  their  approximations  fk+i\k  and  fk-^i\k^i  and 
then  determine  what  transformation  Dk+i\k  l-be  associated  PHD’s  corresponds  to  the 


transformation  fk^i\k  A+ilfc+i  specified  by  the  multitarget  Bayes’  rule.  Under  certain  assumptions 
it  is  possible  to  derive  formulas  for 

Theorem  5  For  example,  suppose  that  (1)  there  is  a  single  sensor  with  (single-target)  likelihood 
function  /(z|x);  (2)  target  observations  are  independent;  (3)  the  probability  of  missed  detection  pp  is 
state-independent;  (4)  <  (1  (5)  sensor  observations  are  corrupted  by  independent, 

state-independent,  Poisson  false  alarms  with  Poisson  parameter  A  and  distribution  c(z).  Let  Zk-\-i  = 
{zi,  ...,Zm}  denote  the  latest  scan  of  data.  Then  it  can  be  shown  that 


^  +  ^k+l{z\ZW)  ( 


(1  -  PD)Nk+i\k 


where 

I)fc+i|fc+i(x|z,z(*’))  =  A'-V(z|x)  Dfc+iifcCxlZ^*)),  Dfe+i(z|Z(''>)  j  f{^\y)Dk^i\kiy\Z^’'^)dy 

and  where  K  =  J /(z|y)I)fc^.i|;fe(y|Z(^))dy.  That  is,  the  Bayes-update  step  for  PHD’s  is  additive — a 
property  that  Stein  and  Winter  call  ”  Weak  Evidential  Accrual.” 

3.3  PRIOR  PHD’s 

Let  fo{X)  be  a  prior  multitarget  density  [11,  p.  37].  Then  we  can  construct  the  corresponding  prior 
PHD  using  the  definition  of  a  first  multitarget  moment  function  (Definition  1):  Do{x)  =  f  /o({x}  U 
W)6W.  Alternatively,  if  we  specify  a  prior  random  track-set  T  then  the  prior  PHD  can  be  constructed 
directly  from  T  using  Theorem  1. 

For  example,  suppose  that  we  have  n  independent  tracks  with  prior  densities  /i(x),  ,..,/n(x)  and 
that  the  i’th  track  is  believed  to  exist  with  probability  TTi.  Then  the  prior  track-set  is  r  =  riU...urn 
where  F^  =  {X^}  U  0^’ .  The  prior  belief-mass  function  is 

MS)  =  Pr(r  CS)  =  Pr(ri  C  5) .  -  •  Pr(r^  C  5)  =  (1  -  TTi  -h  7riPi(5))  . . .  (1  ~  TTn  +  ^nPn{S)) 

where  Pi{S)  =  /i(x)dx.  Using  Theorem  1  we  find  that  the  prior  PHD  is 


Do(x)  = 


6x 


(S) 


^  (1  -  TTi  +  7ripi(5))  •  •  •  7riPi(5)  ■  •  •  (1  -  7r„  +  7r„p„(5')) 

Lt=i  J  s^s 


=  7ri/i(x)  -h ...  -h7rn/n(x) 

The  prior  expected  number  of  target  is,  therefore,  Nq  =  f  Do{x)dx  =  tti  -1-  ...  +  7r-„.. 

As  another  example,  suppose  that  fo{X)  is  a  multitarget  uniform  distribution  [11,  p.  37], 


u{X) 


n!U-^(M-f  1)-^  if  XCS 
0  if  otherwise 


Then  for  x  G  S, 


r  ^  ^  1  r 

Do{x)  =  /  u({x}UVr)5VF=  ^  / 

Js  ^  Js< 


ii  +  l)\ 


V‘+i(M  + 


rfX]  •  dxi 


M 


V'(M+l)^-^  V{M  +  1)  2 


2V 


The  prior  expected  number  of  targets  is,  therefore,  Nq  =  Do{x)dx  = 


4.0  MATHEMATICAL  PROOFS 

4.1  PROOF  OF  THEOREM  1 

Let  Tk  be  a  finite  random  subset  of  state  space  S  and  let  Dk\k{X\Z^^^)  be  its  corresponding  multitarget 
moment  density.  We  are  to  prove  that 

Dfc|fc(X|zW)  = 

for  all  X  C  S,  where  =  Pr(rjfe  C  S)  is  the  belief-mass  function  of  Tk  Let  ^ 

suppose  we  knew  that 

for  any  measurable  S  C  S.  Then  since  W  D  X  =  0  almost  everywhere  it  would  follow  that 

=  J A|,(A'UVtlZ(*))5lF  =  y  =  4>(5)  =  ^(5) 

and  we  would  be  done.  So  let  us  prove  that  ^{S)  =  ^(0)^^  for  $  =  First,  by  Theorem  17  of 

[7,  p.  155]  we  know  that 


oo  „ 

l3k\k{S\Z^^^)  =  '^ai  /i(xi,  Xi)dxi  dx 


for  some  real  numbers  ak  and  where  fi  is  a  completely  symmetric  density  in  i  arguments.  Thus  if  we 
set 


it  follows  that 


f  /i(xi,  ,Xi)dxi  dxi 

Js' 

HS)  =  ^(5)  =  =  Eai3>i(-S) 

t=0 


i=0 


where  4>i  =  ^  for  all  z  >  0.  It  is  enough  to  prove  that  ^i{S)  =  |^(0)^^  for  alH  >  0  since  it  would 

then  follow  that 


as  desired.  To  show  that  4>i(S')  =  ^  |F|  =  j  and  recall  from 

Proposition  19  of  [7,  p.  159]  that 


MY  =  XUWwithX  =  {xi 

if  \W\  and 

otherwise.  Accordingly, 


x^i},  W  —  {wj  and  X  r\W  =  ^  then 


8W^  '  6{XUW) 


(0)  =  O 


(54>i 

(®)  /i(^l  XtxjWi  Wi 


1  f  5‘1>, 

(f^  L-  i{w„.,.,w, 

I  f  _  w 

;*-”)•'  /s 


{0)'/W;  rfWi_, 


Js'-^  Mwi,...,w._„.x,,...,x„} 

J  /i(Wi,...,W,_„,Xi,...,X„)(/W  dWi- 


On  the  other  hand. 


~ 'Sc  {^Js  Wj)dwi  dwij 

^  /  /i(wi  [wi]j,  (w„]j  Wi)-dwi  [dwjj  [dw,- 


dw- 


where  the  last  summation  is  taken  over  all  distinct  ji,  with  1  <  ji,...,ji  <  i,  where  [w]_,  indicates 
that  the  ar^ment  Wj  has  b^n  replaced  by  x,  and  where  [dWj]  indicates  that  the  differential  dwj  is 
excluded.  Since  fi  is  symmetric  and  since  there  are  terms  in  the  summation  we  then  get 

^  7j.)!  Jsi  .Wi_„,xi  x„)dwi  dwi_„ 

Thus  ^i{S)  =  fg  (0)511^  for  alH  >  0  and  we  are  done, 

4.2  PROOF  OF  THEOREM  2 

We  are  to  prove  the  following:  for  any  S  CS 


E[|rfcnS’|]=  f  I>ifc|fc({x}|zW)dx 

Js 

Consequently,  the  first  multitarget  moment  density  and  the  Probability  Hypothesis  Density  (PHD)  are 
equal  alm^t  everyhere:  Dfc|fc(x|ZW)  =  Dfc|fc({x}|ZW).  Let  5w(x)  =  ^x(w)  and  ^^(x)  =  0 

where  Ox(w)  is  the  Dirac  delta  concentrated  on  x.  First  note  that 


f  Sw{x)f^k{W\Z('^^)6W  =  E  ^  /  (5x(w 


+  +^x(wi))/fc|fc({wi  w<}|Z(*))dwi  dsv^ 


1  r 

j=0 

I  A|fc({x}  u  F|Z(^))5r  =  Z)fc|,({x}|Z('=)) 

Now  define  the  indicator  function  l5(x)  by  l£r(x)  =  1  if  x  G  5  and  ls{^)  =0  otherwise.  Then 


jT  £>fc|fc({x}|Z(''))dx  =  J  l5(x)  (^J  dx 

I  (^1  ls(x)5vv'(x)dx^  /fc|fc(Vl'|Z(*'))5ll' 

J  \S  n  w\  /fc|fc(vi-|z(''>)5iv  =  E[\rk  n  5|] 

4.3  PROOF  OF  THEOREM  4 

We  are  to  show: 

Ok+i\k{y\^^''^)  =  J  (4-i|fc(x)/fc+i|fc(y|x) +  fcA,+i|*(y|x)^  f4|A.(x|Z^*))rfx 


First,  notice  that  by  Definition  1  (Section  2.1), 

Dk+i\k{y\Z^'‘^)  =  J  fk+i\k{{y}  U  \V\Z^'^^)6W  =  J  (^J  ,({y}  U  W\X)  A|fc(A'|Z(^>)<!ix)  6W 

1(1 4*1  fc({y}  u  ir|A)w)  A|fc(A'|Z<*’))5A' =  ^  4+i|;t(y|A')  4|fc(A|Z(''))5A 

where 

£4+i|fc(y|^)  =  J  4^iifc({y}uH'|A)^iA 

Suppose  that  we  know  that,  given  X  =  {Xl,  .,x<}, 

i 

Dfc+i|fc(y|^)  = 

j=l 


then  we  will  have 

bk+i\k{y\z^’^^)  J  bk+i\k{y\x)  Mx\z^’^^)sx 


/^(4+i|fc(xj)/fc+i|fc(y|xj)  +  6fe+i|fc(y|xA)  /fe|fc({xi,...,Xi}|Z(*'))dxi  dx 


J  (4+i|fc{x)/fc+i|fc(y|x)  +Sfc+I|fc(y|x)) 


(2  jiy/fc|fc({x,wi,  Wj}|Z(*=))dwi  dwjjdx 

j  (4+i|fe(x)/fc+i|fc(y|x)  +  4+i|fc(y|x))  £>fc+i|fc(y|^^^^)dx 

as  claimed.  We  must  therefore  show  that 

n 

Dk+i\k{y\X)  =  ^  (4+i|fc(xi)/fc+i|fc(y|xi)  4-  4+i|fc(y|xi)j 


Let  =  $fe(x,  W*^)  be  the  single-target  motion  model  that  corresponds  to  the  single-target  Markov 

density /fc+i|fc(y|x),  let  X  =  {xi,  ...,x„}  be  the  target  states  at  time-step  A:,  let  W^, be  i.i.d. 

copies  of  the  random  noise  vector  W*,  and  let  =  $/-(xi,  Wf).  Define  Ff  =  {Xf'*'^}  D  0^  ' 

where  0^''+'!''^*^  is  a  random  subset  of  state  space  5  such  that  0^’'+*"'^’'^  =  0  with  probability  1  - 
4+i|fc(x)  and  0^"+*'''  =  5  with  probability  4+i|fc(x).  We  also  assume  that 

are  i.i.d.  and  that  are  statistically  independent.  Then  Ff  =  0  with 

probability  l-4+i|fc(x)  and  Ff  =  {Xf+^}  with  probability  4+i|fc(x).  Furthermore,  by  assumption 
each  target  with  state  x  at  time-step  k  generates,  independently  of  all  other  targets,  new  ’’birth” 
targets  at  time-step  A: -I- 1  in  a  manner  governed  by  a  multitarget  density  6fc^.i|fc(X|x).  Translated  into 
mathematical  terms  the  multitarget  motion  model  is,  therefore, 

Ffc+i=F^L  UFjjUBfu  UB* 


md  the  corresponding  belief-mass  function  is 

l3k+i\k{S\X)  =  Pr(F?U...UF^UBfu...UB^C5) 

=  Pr(Fj  C  S)  •  •  •  Pr(F^  C  S)  Pr(B{'  C  S)  •  ■  ■  Pr(B*  C  S) 

=  (l  -  4+l|fc(Xl)  +  4-(-llfe(Xl)Pfc+l|fc(-SlXl))  •  •  •  (l  -  4+l|fe(Xn)  -t-  dk+l\k{^)Pk+l\k{S\Xn)) 
•4+iifc('91xi)  •  •  •  4+i|fc('S'lx„) 

where  6fc+i|fc(S|x)  =  /6fc+i|fc(X|x)5X  is  the  belief-mass  function  corresponding  to  the  multitarget 

density  function  fk+i\k{X\Z^’°^).  The  first-order  set  derivative  is 

4^ijfc  (i  —  4+i|fc(xi)  +  4+i|fc(xi)pfc+i|fc(>^|xi)) 

^  i=l 

—  (l  —  dfc4.ijfc(xi)  +  dfc+i|fc(xi)pfc+i|fc(5|xi)) 

(l  —  dfc_|_i|fc(Xn)  +  dfc+ijfc(Xn)Pfc+l!fe(^lXn)) 

•^4+llfc(’^|Xl)  •  •  •  ^4+llfc('S'|Xi)  •  •  •4+l|fc(^|Xn) 
i=l  ^ 

n 

(l  —  rffc+l|fc(Xl)  +  C^fc-|-l|fc(Xl)Pfc+l|fc(*^|^l))  <^fc-fl|fc(Xi)/fc+l|fc(y|^») 


{i  -  4+i|fc(x7i)  +  4+i|fc(x„)p/,-+i|*(‘^|x„)) 

X]  ^k+i\k{S\-s.i) - t^(5’|xi)  •  •  •  6fc+i|fc(5|x„) 

t=l  ^ 

Setting  5  =  and  using  the  fact  that  i^fc+i|fc(*S|xi)  =  1  for  all  i  we  get 


^fc+i|fc(y|^)  = 


(5|X)  =  X  (  1 1  A;  (^i ) /fc+1 1  (y  l^i ) 

i=l 


as  desired 


5.0  CONCLUSIONS 

In  this  paper  I  have  used  finite-set  statistics  (FISST)  to  provide  a  systematic  and  theoretically  rigorous 
theoretical  framework  for  the  Stein-Tenney  Probability  Hypothesis  Density  (PHD)  multitarget  tracking 
approach.  After  framing  the  optimal  multitarget  tracking  problem  as  a  multitarget  recursive  Bayes 
filtering  problem,  I  showed  that  the  PHD  is  actually  a  first-order  statistical  moment  of  the  time-evolving 
random  track-set  F^.  Consequently,  the  PHD  approach  can  be  interpreted  as  a  multitarget  analog  of 
a  constant-gain  Kalman  filter  (e.g.,  the  a-/?-7  filter).  I  showed  how  PHD’s  can  be  computed  directly 
using  the  ’’set  derivative.”  I  also  showed  how  the  conventional  single-sensor,  single-target  recursive  Bayes 
filtering  equations  can  be  generalized  to  the  PHD  case  (though,  in  the  case  of  the  Bayes  update  step,  this 
generalization  can  be  only  approximate). 

Given  this,  from  a  computational  point  of  view  real-time  optimal  multitarget  tracking  is  reduced 
to  the  problem  of  implementing  a  real-time  single-target  nonlinear  filter  that  is  capable  of  modeling  the 
rather  complex  time- evolution  of  the  PHD.  This  is  in  itself  a  difficult  research  problem  that  will  not 
be  successfully  addressed  by  thirty-year-old  techniques  copied  from  textbooks.  Rather,  it  will  require 
advanced  techniques  currently  under  development  by  a  number  of  researchers.  See  [11,  pp.  5-6,  15-16] 
for  a  brief  discussion  of  the  major  computational  issues  and  some  of  the  major  computational  strategies.) 

The  approach  outlined  here  can,  in  principle,  be  generalized  to  develop  a  filtering  approach  based 
on  the  second-order  multitarget  moment  densities  Dfcjfc({xi, X2}|^^^^).  Since  second  moments  can  be 
reformulated  to  have  the  general  form  of  a  covariance,  it  is  aso  possible  in  principle  to  develop  a  statistical 
multitarget  analog  of  the  Kalman  filter. 
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